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Abstract 

Background. Dynamical models of gene regulatory networks (GRNs) are 
highly effective in describing complex biological phenomena and processes, 
such as cell differentiation and cancer development. Yet, the topological 
and functional characterization of real GRNs is often still partial and an 
exhaustive picture of their functioning is missing. 

Motivation. We here introduce CABeRNET, a Cytoscape app for the 
generation, simulation and analysis of Boolean models of GRNs, specifi¬ 
cally focused on their augmentation when a only partial topological and 
functional characterization of the network is available. By generating 
large ensembles of networks in which user-defined entities and relations 
are added to the original core, CABeRNET allows to formulate hypothe¬ 
ses on the missing portions of real networks, as well to investigate their 
generic properties, in the spirit of complexity science. 

Results. CABeRNET offers a series of innovative simulation and mod¬ 
eling functions and tools, including (but not being limited to) the dynam¬ 
ical characterization of the gene activation patterns ruling cell types and 
differentiation fates, and sophisticated robustness assessments, as in the 
case of gene knockouts. The integration within the widely used Cytoscape 
framework for the visualization and analysis of biological networks, makes 
CABeRNET a new essential instrument for both the bioinformatician 
and the computational biologist, as well as a computational support for 
the experimentalist. An example application concerning the analysis of 
an augmented T-helper cell CRN is provided. 


Introduction 

Consistently with the increasing availability of hig data regarding biological 
systems, is the need of mathematical and computational models aimed at their 
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effective analysis and interpretation [KitOl Kit02 . To this end, in the spirit 
of statistical physics and complex systems science, even simplified and abstract 
models have proven effective in representing complex biological phenomena, with 
specific focus on the emergent dynamical behaviours and the so-called generic 
or universal properties |Kau93 Kan06 


In the context of genomics data, one of the best examples is that provided 
by Boolean models of gene regulatory networks (GRNs), which have repeatedly 
proved fruitful in describing key properties of real systems, as well as in provid¬ 
ing cues and hints for wet-lab experiments (see, e.g., Kau95 SDKZ02||SDZ02 


KPST03|[SV^|KPST04||SKA05||MYH0^|SVGK07[1SVG+08t|CQL10| ). The 

simulation of partially characterized regulatory architectures with a Boolean ap¬ 
proach, in particular, has recently gained attention (see, e.g., SvHT97|AMK~*~9¥^ 


STOT||A003MLLL+04||GAES+06|FNGT06|KSRL+06||DB08| ). A first motivation 


lies in the inherently “dynamical” nature of gene (de)regulation processes, and 
in the clear limitations of a “static” analysis capturing only a partial picture of 
such complex processes. For example, a structural analysis of the genomic in¬ 
teractions might preclude to determine the influence of a target-selective therapy 
on the overall GRN interplay ruling tumorigenesis. 

Moreover, as compared to continuous models, such as ODF-based or stochas¬ 
tic models (see |KS08 for a review on GRN modeling), the Boolean abstrac¬ 


tion allows for a clear and effective characterization of the gene activation pat¬ 
terns (or attractors in the terminology of dynamical systems) characterizing 
the different phenotypic functions, such as cell types, modes and fates, un¬ 


der the metaphor of “emergent collective behaviours” |Kau93| HIOO HEBYI05 

FN06 

RK07 

HEK09 CSKJIO FCP+10 

FK12 

CYA13 

NWV14 . In this con- 


text, phenomena such as tumorigenesis can be explained as rare emergent pat¬ 
tern, triggered by signals, stochastic fluctuations and biological noise (see, e.g.. 


EEIO Tsil4 ). 


In line with this approach, we here continue on a recent research strand 
that has involved the development of simulation and analysis tools for dynam¬ 
ical Boolean GRNs (see, e.g., ABG+13 ). The foundations of our theoretical 


framework lay in the seminal work by Stuart Kauffman on Random Boolean Net¬ 
works (RBNs) |Kau69b[|Kau69a| and, more recently, on the dynamical model of 
cell differentiation introduced in SVB+10 VBSll and based on Noisy Random 


Boolean Networks (NRBNs). In this theoretical framework, cell types are asso¬ 
ciated to dynamical gene activation patterns and differentiation to cell-specific 
noise-resistance mechanisms, the underlying hypothesis being that such mech¬ 
anisms refine along with differentiation stages (see HGH+08 and references 
therein). In the Background Section the main features of this approach are 
outlined. 


In this paper we introduce CABeRNET, a Gytoscape |SMO~*~03 app for 
the generation, import, simulation and analysis of Boolean models of GRNs. 
With respect to similar approaches, e.g., |LK11 or the earlier work by our 
group ABG+13 , CABeRNET also allows to augment partially-characterized 


GRNs by adding user-defined entities and relations. The underlying motivation 
is that, despite the increasing knowledge on gene regulation in real organisms. 


2 
























































































the topological and functional characterization of real networks is stil far from 
being comprehensive, thus preventing to capture the complexity of the overall 
interplay: CABeRNET is conceived to randomly generate the missing portions 
of the partially-characterized GRNs, in order to obtain ensembles of augmented 
GRNs that share the common core and other structural and functional param¬ 
eters, hence allowing to test hypotheses on the yet unknown missing portions of 
real networks. Besides, in line with the complex systems approach, the generic 
properties of such ensembles can be inferred and investigated, possibly pro¬ 
viding an ensemble-level understanding of yet unraveled GRN phenomena. In 
addition, CABeRNET guides the user through various robustness analyses of 
the GRNs, which can be matched against genomic experimental data. Notice 
that CABeRNET can also generate completely random GRNs with defined 
structural and functional constraints, as well as simulate the dynamics of com¬ 
pletely characterized GRNs. In the Implementation Section the main features 
of CABeRNET are described. 

As a proof of principle, in the Results Section we present the augmentation 
of the T-helper cell signaling network and describe the analysis of its dynamics, 
with particular focus on the emergent differentiation scheme and its robustness 
against the knockout of specific genes. Conclusions are presented in the last 
Section. 


Background: the GRN model 


In CABeRNET, GRNs are represented as Noisy Random Boolean Networks 
VBSll . A classical RBN is a graph composed of n Boolean nodes associated 
with binary variables at € {0,1}, representing the activation of a gene: if (7^ = 1 
the i—th gene synthesizes its product (i.e., proteins or RNAs), otherwise it 
does not. Each node is connected to ki < n — 1 nodes which represent its 
regulatory inputs, i.e., those genes influencing the activation of the z—th gene. 
The way in which regulation takes place is via a node-specific Boolean function 
fi of the regulatory inputs: the value of cr^ at time t -I- 1 is determined as 
ai{t + 1) = (t), cTjv, (t),..., (Tjj, (t)). The dynamics of canonical RBNs is 

synchronous, deterministic and follows discrete time steps. Thus, it eventually 
ends up in (at least unitary) state cycles, termed attractors, which represent 
the emerging gene activation patterns displayed by the GRN. Remarkably, by 
varying the structural features of the networks, different dynamical regimes 
appear, ranging from ordered to disordered behaviors (see |Kau93| ). 


In VBSll 


Villani et al. introduced a form of stochasticity in RBNs, hence 
speaking of “Noisy RBNs”. NRBNs allow for the transitions among attractors as 
a consequence of random “flips” of aiS value, i.e., random modifications of the 
node’s activation value lasting a defined time span. Thus, with this approach 
it is possible to determine the so-called Attractors Transition Network (ATN, 
or matrix, ATM), i.e., a stability matrix displaying the probability of a gene 
activation pattern to emerge in response to a perturbation in another pattern 
(noise-induced transition). A noise threshold is then introduced to exclude tran- 
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sitions unlikely to occur for a cell in a certain differentiation stage. Therefore, 
multiple thresholds allow to identify distinct threshold-dependent ATNs, each 
one representing the stability of the CRN’s activation patterns characterizing 
a specific differentiation stage (e.g., toti-/multi-potents stem cells, progenitors, 
differentiated cells, etc). The exact mathematical definition of such patterns is 
given as Threshold Ergodic Sets (TESs), which: i) are strongly connected com¬ 
ponents in the threshold-dependent ATN, ii) and have no outward transitions 
outside the components’ set. These structures are hierarchical, thus can be 
naturally mapped to cell types yielding to emergent differentiation trees. Such 
trees mimic the metaphor that less differentiated stages have less refined noise- 
control mechanisms, resulting hence in TES associated to lower thresholds (see, 
e.g., HKG+Q? HLS08 FK09 ). In general, the number of patterns observable 


in a certain cell type decreases progressively along with the differentiation stage 
(see Fig. [^for a simplified representation of the model). 

Notice that the approach is general, it does not refer to any specific organism, 
and has been shown to be suitable to describe i) different degrees of differen¬ 
tiation, i.e., from toti-/multi-potent stem cells to intermediate states, to fully 
differentiated cells; ii) the stochastic differentiation process, in which a popu¬ 
lation of toti-/multi-potent cells stochastically generates progenies of distinct 
types; and iii) the induced pluripotency phenomenon, according to which fully 
differentiated cells can revert to a pluripotent stage through the perturbation 
of some key genes |Yam09 . 


Implementation 

CABeRNET is a Java tool developed as Cytoscape’s version 3.x application; 
see the Availability Section for information about download and installation of 
the tool. CABeRNET sessions are user-defined batch computations. Parame¬ 
ters are specified by a step-by-step wizard and various post-simulation functions 
are accessible directly from the CABeRNET menu in the Cytoscape active win¬ 
dow. The tool implements a wide range of simulation and analysis functions, 
which can be summarily listed as follows (see Fig. |^for a summary of functions 
and parameters, and the user manual, on the plugin website, for a detailed 
description). Instructions on obtaining and using the code used in the paper 
are available on the page website. Tutorials include a package to reproduce the 
example application discussed in the paper (see Availability). 


Input GRNs (generation, import and augmentation) 

[Random network generation] CABeRNET can randomly generate and 
simulate ensembles of CRNs with certain structural parameters such as: (i) 
number of genes; (ii) ingoing/out going CRN’s topology, e.g., fixed, Erdds- 
Renyi’s rand om |ER59j , Barabasi-Alberts’ scale-free BA99| or Watts-Strogatz’s 
small-world WS98j ; {iii) type of regulation functions (node-specific Boolean 
functions). Concerning the latter, these can be set by the user or randomly gen- 
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Figure 1: Simplified representation of the GRN model in CABeRNET. (a) Exam¬ 
ple RBN with 3 genes (nodes) and edges representing regulatory interactions (via node-specific 
Boolean functions, not shown), (b) Example dynamics to highlight network’s attractors, mod¬ 
eling gene activation patterns Ai, . .., Ag, and possible transitions among them induced by 
noise (i.e., single flips), (c) The transitions yield an Attractor Transition Network that gener¬ 
ates 5 cellular types when 3 thresholds, <5i, i = 1, 2, 3 are evaluated to asses the corresponding 
Threshold Ergodic Sets. In this approach, where the efficiency of noise-control mechanisms 
is related to differentiation types, stem cells (pink), intermediate stages (light blue) and fully 
differentiated cells (yellow, purp le and grey) emerge. The corresponding differentiation tree, 
is shown (Figure modified from |GCDMA14|). 


erated to accomodate: (i) bias-based random functions, (ii) canalyzing KPST04 
or (in) logical functions - expressed in the canonical AND/OR notation. Net¬ 
work and simulation parameters (e.g., samples size) can be defined either via 
an input form or a textual file. 


[Import of characterized GRNs] GRNs that are characterized with respect 
to both the topology and the regulatory functions, either fetched from public 
datasets via Cytoscape or loaded via input textual file, can be processed with 
CABeRNET. 


[Augmentation of GRNs] Any GRN loaded in the tool (see above) can be 
augmented by CABeRNET, by randomly generating a chosen number of aug- 
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Input GRNs [generation, loading and augmentation) 


Network generation type 

Parameter/Input 

Values/Notes 

Random Network Generation 

Number of nodes/edges 


Ingoing/outgoing topology 

Fixed inputs/outputs 

Erdos-Renyi's random 

Barabasi-Alberts' scale-free 

Watts-Strogatz's small-world 

Type and proportion of updating Boolean 
functions 

Bias-based random functions 

Canalyzinq 

Logical functions (AND/OR) 

Import of characterized GRNs 

Original network 

Input via qrnml file or via Cytoscape active window 

Augmentation of GRNs 

Original network 

Input via qrnml file or via Cytoscape active window 

Number of nodes/edges 

Including the original core 

Constraints on source/target nodes 


Type and proportion of updating Boolean 
functions 

Bias-based random functions 

Canalyzing 

Logical functions (AND/OR) 

CRN's d] 

^namics simulation 

GRN-selection constrained by differentiation 

Parameter/Input 

Values/Notes 


Parameter/Input 

Values/Notes 

Attractors search 

Exhaustive 

Differentiation tree 

Input via text file or via 
Cytoscape active window 

Sampling-based 

Tree matching criterion 

Perfect match 

Type and features of the 
perturbations 

Temporary node-flips (with specific features) 

Minimum distance (Equation 

1) 

Permanent node knock-in/knock-out (with 
specific features) 


VisualizaHon 

Robustness analysis 

Parameter/Input 

Values/Notes 


Parameter/lnput 

Values/Notes 

Network Views (in Cytoscope 
active window) 

NRBNs 

Type and features of the 
perturbations 

Temporary node-flips (with 
specific features) 

Attractor graph network 

Permanent node knock- 
in/knock-out (with specific 
features] 

Threshold dependent ATNs 

Perturbed node selection 

Random 

Representative differentiation tree [when 
computed) 

Specific nodes 

Network Styles 

CABeRNET network 

Perturbed network 

User-selected 

CABeRNET attractors 

All the networks 

CABeRNET TES 


CABeRNET collapsed TES 

Network analysis 

Outputs 

Statistics 

Notes 


Output type 

Notes 

Distribtution of the number of 

attractors 

Computed on all networks 

Network structural 
properties (topology and 
functions) 

Both in grnmf and sif format 

Distribution of the attractors' 
lengths 

Either for one or all networks 

Attractors transition matrix 

In CSV format 

Basins of attraction 

Attractors 

In CSV format 

Proportion of frozen and 
oscillating nodes 

For each network 

Node configurations of 
each attractor 

In CSV format 

Clustering coefficient 

Attractor lengths 

In CSV format, unique file for 
all the networks. 

Network diameter 

Basin of attraction 

In CSV format, unique file for 
all the networks 

Average path length 

Network Synthesis file 

In CSV format 


Figure 2: Main functionalities and parameters of CABeRNET. A schematic repre¬ 
sentation of the various functions and parameters of CABeRNET is provided, as explicitly 
described in the Implementation Section in the main text. For a thorough explanation please 
refer to the user manual (see Availability). 


mented networks, in which entities and relations are added to the input network 
according to user-defined structural and functional parameters. Parameters for 
augmentation are the same that must be defined for the random generation 
(see above). The resulting ensemble of augmented networks will share the in¬ 
put topological and functional core and will differ for the randomly generated 
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portion. 

For instance, in the Results Section, the T-helper cell signaling network cu¬ 
rated from KSRL+06 (40 genes and 51 regulatory interactions), is augmented 
with 160 further nodes and 349 edges according to a Erdos-Renyi random topol¬ 
ogy. Assessment of the functional effect of this augmented network is also dis¬ 
cussed. 


CRN’s dynamics simulation 

Given that the space of the possible configurations of a NRBN can be dramat¬ 
ically large (there are 2^ possible configurations for a network with N nodes), 
CABeRNET can simulate the dynamics of a network by either: (i) sampling 
the initial conditions to test or (ii) performing an exhaustive search (for small 
networks only). CABeRNET allows to investigate key statistics of the emerg¬ 
ing attractors such as, e.g., number, length, robustness and reachability. The 
stability of any pattern to perturbations can be assessed either via temporary 
flips (with duration of 1 step) or via permanent gene knock-in/knock-out. 

Following the simulation, CABeRNET can compute and display the threshold- 
dependent ATN (here called the TES network) for specific threshold values. 
Different views on such a network are available in the tool and, for instance, 
allow to display the genes’ configurations in a pattern, and the variation of the 
number of TESs alongside thresholds, as proposed in GDP+14 . 


GRN-selection constrained by differentiation scheme 

One might look for a CRN giving rise to a specific differentiation tree, as done 
e.g. in |GCDMA14 . Trees can be inputed to CABeRNET in textual format or 
from the Cytoscape active window; CABeRNET can select those NRBNs that 
display a differentiation tree structurally similar to the loaded one, where the 
measure of similarity is defined by the user. This feature is implemented as a 
batch process scanning among generated NRBNs. Notice that, usually, a single 
NRBN exhibits various emerging trees, according to the various possible combi¬ 
nations of thresholds thus this feature is the computationally most demanding 
in CABeRNET. 

Besides, the statistically representative differentiation tree(s) of each specific 
network, defined as the most frequent emerging tree for different (sampled) 
threshold combinations, provided a specific tree depth, can be computed. 


Visualization 

Each computational task is tracked by a progress bar. Once the simulation of 
the dynamics is completed, the powerful visualization capabilities of Cytoscape 
can be used to analyze the topological and dynamical properties of the networks. 

In particular, with CABeRNET it is possible to visualize: (i) the NRBNs, 
(ii) the attractor graph network, in which all the states of the attractors and the 
transitions among them are displayed, (Hi) the threshold dependent ATNs and 
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(iv) the representative differentiation tree. By clicking on a specific network, it 
gets visualized within Cytoscape, so that it can be further analyzed. 

Different network styles have been defined and can be selected: {i) CABeR- 
NET network, aimed at visualizing the properties of the NRBN: the color of 
each node being related to the Boolean function bias and the size of each node 
proportional to its degree, (ii) CABeRNET attractors, for the visualization of 
the attractor graph network, (in) CABeRNET TES, for the visualization of 
the ATN and {iv) CABeRNET collapsed TES, for the collapsed visualization of 
the ATN: in these last two styles, the edge size is proportional to the transition 
probability. 


Robustness analysis 


Different kind of perturbations can be applied to a simulated network. In par¬ 
ticular, it is possible to perform a user-defined number of (i) temporary (i.e., 
flips) or (ii) permanent (i.e., knock-in/knock-out) perturbations on (i) a chosen 
number of randomly selected nodes or (ii) specific nodes. The robustness anal¬ 
ysis can be performed on single networks or on the whole ensemble of simulated 
GRNs. 

Network’s stability is assessed via robustness analyses, by means of standard 
measures such as avalanches (i.e., the number of nodes whose activation pattern 
is different in a perturbation experiment with respect to the wild type scenario) 
and sensitivity (i.e., the number of perturbation experiments in which a certain 
node’s pattern is affected) SVGK07||GSV~*~llb| . The results of the analyses can 
be exported in csv files. 


Network analysis 

CABeRNET offers a wide range of network-specific statistics. These include {i) 
the distribution of the attractors’ lengths, {ii) the basins of attraction, {Hi) the 
proportion of frozen and oscillating nodes, plus other classical network measures 
such as clustering coejficient, network diameter and average path length. All 
the statistics can be visualized and exported; further network measures are 
accessible via the network analysis tools included in Cytoscape. 


Outputs 

All the networks and the relative topological, functional and dynamical prop¬ 
erties can be exported as textual files, from both the Wizard and the Func¬ 
tion menu. For instance, the complete topological and functional description 
of the networks can be exported so that it can be used in simulation envi¬ 
ronments external to Cytoscape such as, e.g., CHASTE |PFPB~*~09 , Compu- 


CellSD STB+12 or the simulator described in GCDMA14 (see |DMGAT3 


for a recent review on multiscale models of multicellular systems). Also, it is 
possible to export the complete description of all the attractor states, as well as 
information of their basins on attraction. 














Results 


Our group has recently been focnsing on the investigation of the dynamical 
properties of mnlticellular systems via multiscale simulations, with particular 
attention to the conditions that wonld favor the emergence and development of 
tumors. To this end, CABeRNET was recently nsed to generate, simnlate and 
visualize the GRNs ruling the behaviour of an intestinal crypt in CHASTE’s 
mnltiscale simnlation engine, allowing to identify conditions for cancer’s emer¬ 
gence and crypt’s colonization RGC+15 . In the following, we propose a farther 


example to show some of the potential applications of CABeRNET. 


[Augmentation of T-helper signaling network] The signaling network 
of human T-helper cells was recently characterized with respect to both the 
topology and the regnlatory fnnctions. In |MX06j the dynamics of snch network 
was simnlated with a Boolean approach and it was shown that the attractors 
actnally reproduce real gene activation patterns of distinctly differentiated T- 
helper cell types. With CABeRNET the same dynamical analysis could have 
been easily performed. 

In the following, we present an experiment of CRN’s angmentation possible 
in CABeRNET. To the best of onr knowledge, no experiments of this sort are 
possible with other RBN-based tools. The final goal is to: (?) generate a large 
ensemble of random networks with the T-helper fnnctional core, and (ii) select 
only those networks in which the emergent dynamical behaviour is in accordance 
with the hematopoietic differentiation tree, in which the T-helper cell type is 
supposed to be one of the leaves (see Fig.[^. 

The distinct networks differ for the angmented portion, which is randomly 
generated with structural parameters (i.e., Erdos-Renyi random topology, av¬ 
erage connectivity = 2, random Boolean functions with bias = 0.5) that are 
classically used in similar stndies (see, e.g., SVGK07 GSV~*~11^ ). Accordingly, 
only certain networks will eventnally display the desired emergent dynamical 
behavior. The underlying idea is that the matching networks might allow for 
the formnlation of hypotheses on the missing portions of the relevant GRN 
ruling the overall hematopoiesis process. Besides, the characterization of the 
attractors could be matched with the real gene activation patterns driving the 
functioning of the varions cell types. 

In the experiment, a large number of distinct angmented networks was gen¬ 
erated and simnlated with CABeRNET: only 1 on a total of 600 augmented 
NRBNs actually displayed the expected dynamical behavior, i.e., the correct 
differentiation tree, hinting at the complexity of the tnning process driven by 
the evolntionary pressnre that led to the topology of cnrrent GRNs. 

In Fig. |3] one can see the original T-helper signaling network, originally 


mapped in KSRL+06 , and the NRBN that was selected as correct, in which 


the angmented portion is highlighted. Notice that, in the angmented network, 
both the topology and the functions of the original core are slightly different from 
those of the T-helper GRN, as a consequence of adding new relations linking 
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the new and the original portions of the net. The visualization of the network 
is provided via CABeRNET, by applying the suitable styles (see above). 

In Fig. one can notice that this specific network exhibits 8 distinct at¬ 
tractors (each one characterized by a length equal to 8 NRBN time steps). By 
pruning the ATN with increasingly larger thresholds, the TES at the higher 
level, including all the 8 attractors connected by noise-induced transitions and 
representing multi-/toti-potent cells, progressively splits in TESs enclosing an 
increasingly lower number of attractors, up to the 7 TESs at the lower level, 
which correspond to single attractors, when the threshold is equal to 1. 

The resulting emergent differentiation tree perfectly matches that of hematopoi¬ 
etic cells (taken from LIYT+13 , see Fig. [^, which is characterized by a multi- 
potent progenitor {MPP\ antecedent hematopoietic stem cells, HSCs, are not 
shown in the scheme), with the potential to differentiate into two lineages, i.e., 
common myeloid progenitor {CMP) and common lymphoid progenitor {CLP). 
CMP further divide into megakaryocyte-erythroid progenitor {MEP) and gran¬ 
ulocyte/monocyte progenitor {CMP), finally committing to mature blood cells 
including erythrocytes {EC), megakaryocyte {MK), monocyte (M) and gran¬ 
ulocytes, i.e. neutrophils, eosinophils, basophils {N/E/B). Conversely, CLP 
further differentiate into B-cell progenitors {B PROC) and T-cell and natural 
killer cell progenitors {T/NK PROC), with a final commitment to mature B 
cells {B), T cells (T) and NK cells {NK). 

Hence, it is possible to hypothesize that this specifically selected NRBN 
might present topological and functional properties ensuring the correct emer¬ 
gent differentiation scheme. We remark that, in case augmented networks were 
more likely to display a matching emergent tree, one may exploit CABeRNET 
to perform ensemble-level analyses on the matching set, aimed at the formula¬ 
tion of hypotheses on the generic properties of real networks. 

A robustness analysis on the matching NRBN was also performed. By sim¬ 
ulating selective single knockouts of the genes in the original T-helper core, 
we can assess the distinctive relevance in maintaining the correct differentiation 
scheme. In this example, we performed 40 single knockout experiments (KO), by 
forcing the specific Boolean function of each gene to inactivation (i.e., 0 output 
for any regulatory input), and we tried to match the resulting differentiation 
trees with that of hematopoietic cells. Remarkably, in 35 cases (88%), the KO 
experiment resulted in a mismatching tree, hinting at the role of those specific 
genes in the interplay leading to the emergence of the hematopoietic tree. We 
measured the similarity among the hematopoietic tree, h, and a tree T resulting 
from a KO experiment as follows: 


r k* 

= EE I nh{k,l) - nT{k,l) \ 

1=0 k=0 

where I* and k* are the maximum depth and the maximum number of a node’s 
children in both h and T. Function nx{k, 1) returns the number of nodes at level 
I with k children in tree x; thus, this quantity measures the structural level-by¬ 
level similarity of two trees by assessing the number of parents with k children. 
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per level. Since we focus on differentiation trees, this can be interpreted as a 
measure of the ability of a certain cell type, a progenitor, to differentiate in a 
set of distinct subtypes. 

In Fig.j^we show values of d in our experiments; the lower the value the closer 
is T to h. Values of d range around 9, with a maximum of 17 and minimum 
0; in 8 cases, the value lower than 5 suggests a close similarity between the 
emergent and the hematopoietic tree. Besides, the dynamics turned out to be 
completely insensitive (i.e., d = 0) to the KO of 5 specific genes, i.e., CRE, 
Ca, PLCg-a, Fos and Gads, which, accordingly, might be not relevant in the 
differentiation process. Clearly, further investigations are needed to corroborate 
this hypothesis. 

We finally remark that the generated networks could be used within any 
multiscale simulation frameworks, in order to investigate, e.g., the processes of 


homeostasis and clonal expansion, as proposed in GCDMA14 RGC+15 


Conclusions 

In this work we introduced CABeRNET - a new Cytoscape app for the gener¬ 
ation, simulation and analysis of augmented Boolean models of gene regulatory 
networks - and described some of its key functionalities, as well as an example 
application to real GRN data. 

CABeRNET is the final result of a long-time effort aimed at bridging dif¬ 
ferent fields and disciplines, such as computer science, statistics and complex 
systems science, for the effective study of complex biological systems. The nu¬ 
merous modeling and simulation functionalities, the various effective analysis 
tools and the fine integration within the widely used Cytoscape framework, 
might settle the ground for CABeRNET becoming a powerful instrument for 
bioinformaticians and computational biologists, especially in providing a com¬ 
putational support for experimentalists. 

In particular, CABeRNET can provide an essential tool to effectively in¬ 
vestigate key and still partially undeciphered biological phenomena, such as, 
e.g., gene regulation, cell differentiation and tumorigenesis, with particular fo¬ 
cus on the properties of dynamical gene activation patterns and their relation 
with biological noise. 

Availability and requirements 

Project name: CABeRNET: a Cytoscape app for the generation and the 
Analysis of Boolean models of gene Regulatory NETworks 

Version: 1.0 

Plugin website: http://bimib.disco.unimib.it/index.php/CABERNET 
Operating systems: platform independent 

Software requirement: Cytoscape 3.x (http://www.cytoscape.org/) 
Programming language: Java 
License: BSD-like license (see website) 
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Figure 3: Dynamical simulation and robustness analysis of an augmented T- 
helper GR N with CABeRNET. (A) The T-helper signaling network, mapped in 
[KSRL~*~06| . Edges stand for regulatory interactions, either activating (black) or inhibiting 
(red). The network is composed by 40 genes and 51 interactions. (B) The augmented NRNB 
that displayed a differentiation tree matching the hematopoietic one. To find it, 600 NRBNs 
were randomly generated by augmenting the T-helper GRN in CABeRNET; the augmented 
networks include 200 nodes (160 nodes added to the original core) and 400 edges (349 new ones, 
average connectivity = 2). The nodes are wired according to a random Erdos-Renyi topology, 
and random Boolean functions with bias = 0.5 are associated to the nodes. Only matching 
NRBN is shown, the original core and the augmented portion of which are highlighted. In 
CABeRNET’s visualization the size of each node is proportional to its connectivity degree 
and t he color-sc ale to the function bias. (C) The differentiation tree of hematopoietic cells 
from [LIYT~*~13] is depicted. Notice that T-helper cell type represents one of the leaves of 
the tree. For the description of the acronyms please refer to the main text. (D) The At¬ 
tractor Transition Matrix of the matching NRBN is plot by CABeRNET, highlighting the 
noise-induced transitions among attractors and the Threshold Ergodic Sets representing cell 
types. The progressive splitting of the TESs ch^ to increasingly larger noise resistance-related 
thresholds (i.e., 5 = 0,0.023,0.056,1) is shown, stressing the perfect matching between the 
emergent differentiation tree and that of hematopoietic cells, from multi-potent cells to fully 
differentiated cell types. (E) Configuration of the 8 attractors (determining the gene activa¬ 
tion patterns). In this specific case, the length of each attractor is equal to 8. (F) Robustness 
analysis performed via CABeRNET. Single node knockout experiments (i.e., silencing the 
node’s Boolean function) are performed on each node of the original core of the augmented 
network and the dynamics is simulated again via CABeRNET. The emergent tree is then 
compared with that of hematopoietic cells and the distribution of the similarity measure d 
(Equation Q is displayed, highlighting 5 genes that, when silenced, still lead to a matching 
emere'ent tree Ti-e.. d = Db 
















































